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Abstract 

We present an algorithm to approximate the solutions to variational problems where set of 
admissible functions consists of convex functions. The main motivator behind this numerical 
method is estimating solutions to Adverse Selection problems within a Principal- Agent frame- 
work. Problems such as product lines design, optimal taxation, structured derivatives design, 
etc. can be studied through the scope of these models. We develop a method to estimate their 
optimal pricing schedules. 
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1 Introduction 



Arguably, Newton's problem of the body of minimal resistance is the original variational problem 
with convexity constraints. It consists of finding the shape of a solid that encounters the least 
resistance when moving through a fluid. This is equivalent to finding a convex function from a 
convex domain (originally a disk) in R 2 to R that minimizes a certain funtional (see section [2]). 
Newton's original "solution" assumed radial symmetry. This turned out to be false, as shown by 
Brock, Ferone and Kawohl in [3], which sparked new interest to the study of variational problems 
with convexity constraints. One can also find these kinds of problems in finance and economics. 
Starting in 1978 with the seminal paper of Mussa and Rosen [16], the study of non-linear pricing 
as a means of market screening under Adverse-Selection has produced a considerable stream of 
contributions (PQ, [5], [I7J )•••)• I n models where goods are described by a single quality and the set of 
agents is differentiated by a single parameter, it is in general possible to find closed form solutions 
for the pricing schedule. This is, however, not the case when multidimensional consumption 
bundles and agent types are considered. Although Rochet and Chone [TTJ provided conditions 
for the existence of an optimal pricing rule and fully characterized the ways in which markets 
differentiate in a multidimensional setting, they also pointed out that it is only in very special 
cases that one can expect to find closed form solutions. The same holds true for models where the 
set of goods lies in an infinite-dimensional space, even when agent types are one-dimensional. This 
framework was first used, to our knowledge, by Carlier, Ekeland and Touzi [5] to price financial 
derivatives traded "over-the-counter". It was then extended by Horst and Moreno [11] to model 
the actions of a monopolist who has an initial risky position that she evaluates via a coherent risk 
measure, and who intends to transfer part of her risk to a set of heterogenous agents. In both 
cases the authors find that only very restrictive examples allow for explicit solutions. 

Given that a great variety of problems, such as product lines design, optimal taxation, struc- 
tured derivatives design, etc. can be studied through the scope of these models, there is a clear 
need for robust and efficient numerical methods that approximate their optimal pricing schedules. 
Note that this also provides an approximation of the optimal "products" . Most of the papers men- 
tioned above eventually face solving a variational problem under convex constraints. This family 
of problems have lately been studied under different scopes. Carlier and Lachand-Robert [6] have 
studied the C 1 regularity of minimizers when the functional is elliptic and the admissible functions 
satisfy a Dirichlet-type boundary condition. Their results can be extended to of our examples. 
Lachand-Robert and Pelletier [15] characterize the extreme points of a functional depending only 
on V/ over a set of convex functions with uniform convex bounds. In this paper we provide sev- 
eral variants of an algorithm, based on the idea of approximating a convex function by an affine 
envelope, to solve these types of problems. This deviates from previous work by Chone and Herve 
[8] and Carlier, Lachand-Robert and Maury [7J, where the authors use finite element methods. 
In the former case, a conformal (interior) method is used and a non-convergence result is given. 
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As a consequence, the latter uses an exterior approximation method, which is indeed found to 
be convergent in the classical projection problem in Hq. Lachand- Robert and Oudet present in 
j!4j and algorithm for minimizing functionals within convex bodies that shares some similarities 
to ours. For a particular problem, they start with an admissible polytope and iteratively modify 
the normals to the facets in order to find an approximate minimizer. 

We estimate the minimizers for several problems with known, closed form solutions as a means 
of comparing the output of our method to the true solutions. These are taken from [T7] and [5]. 
Finally, we provide an example in which we approximate the solution to a risk-minimization 
problem similar to the one presented in [llj . This is still based on the afnne-envelope idea, 
but requires some additional methodology, since it involves solving a non-standard variational 
problem. 

The remainder of this paper is organized as follows. In Section [2] we state our problem and 
provide some classical examples. Our algorithm and a proof of its convergence are presented in 
Section [3j In Section 0] we show the solutions obtained via our algorithm to several problems 
found in the literature. Since these problems share a common microeconomic motivation, we 
include a brief discussion on the latter. The examples include the well known "Rochet-Chone" 
problem, a one dimensional example from Carlier, Ekeland and Touzi and the risk transfer case 
for a principal who offers call options with type-dependent strikes and evaluates her risk via the 
"short fall" of her position. This section is followed by our conclusions. Finally a section devoted 
to technical results and all our codes are included in the appendix. 



The aim of this paper is to present a numerical algorithm to approximate the solutions of some 
variational problems subject to convexity constraints. A classical example of the latter is Newton's 
problem of the body of minimal resistance, which, given B a smooth subset of M 2 , consists of 
minimizing 



over the set of convex functions {/ : — ► M}. We use the following notation throughout: 

• 6,Q C M 71 are convex and compact sets, 

• L(9, z,p) = z — 9 ■ p + C(p), where C is strictly convex and C 1 . 

• C := {{ : 9 — > R | { > / is convex, and V{ G Q a.e}, 



2 Setting 




. /[/] :=/ e L(0, /(<?), V/(0))d0. 
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Our objective is to (numerically) estimate the solution to 



V : = inf /[/] (1) 

fee 11 w 

We assume C is such that ([Tj has a unique solution (see, for example, [T3] ). Given the properties 
of L and C we immediately have the following 

Proposition 2.1 Assume v solves V, then there is 9q in such that v(9q) = 0. 
Proof. Let vq = ming/ g v{9) (recall 6 is compact)and define u{6) := v{9) — t>o, then 
I[u]= I u(9)-9-Vu(9) + C(X7u(9))d9 = I[v]-\\e\\v . 

This would contradict the hypothesis of v being a minimizer of I over C unless vq = 0. □ 

It follows from proposition 12.11 that we can redefine C to include only functions that have a 
root in O. This, together with the compactness of Q, implies the following proposition, which we 
will use frequently. 

Proposition 2.2 There exists < K < oo such that v < K for all v in C. 

It follows from Proposition 12.21 and the restriction on the gradients that for each choice of function 
C, problem V has a unique solution, since the functional / will be strictly convex, lower semi 
continuous and the admissible set is bounded (see [9]). 

Remark 2.3 Our algorithm will still work with more general L 's as long as one can prove that 
the family of feasible minimizers is uniformly bounded. 



3 Description of the Algorithm 

From this point on, whenever we use supscripts we refer to vectors. For example V k = (V^ fc , . . . , V£) 
On the other hand a subscript indicates a function to be evaluated over some closed, convex subset 
of IR fc of non-empty interior, ie, {Vk} is a sequence of functions : X — > R for some X contained 
in R n . 

Assumption 3.1 We will consider = [a, b] n . 

To find an approximate solution to V, we proceed as follows: 

1. We discretize the domain O in the following way: We partition it into which consists 
of k n equal cubes of volume ||Sfc|| := (^p)™ • The elements of will be denoted by a k , 
1 < J < fc n . Now define as the set of centers of the cr^'s. The elements of 0& will be 
denoted by 9j. The choice of a uniform partition is done for computational simplicity. 
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2. We denote fi = f (9 k ) and associate such weight with 9 k . 

3. We associate to each element 9 k of 0^ a non-negative number v\ and an n-dimensional 
vector D k . The former represents the value of v{9 k ) and the latter X7v(9 k ). 

4. We solve the (non-linear) program 

k n 

V k :=inf HEfcllJ^L^Ui.A)/* (2) 

8=1 

over the set of all vectors of the form v = (v±, . . . , v^n ) and all matrices of the form D = 
(Di, . . . , Dkn) such that: 

(a) v > (non- negativity) , 

(b) Di G Q for i = 1, . . . k n (feasibility), 

(c) Vi — Vj + Di ■ (Oj — Qj) < (convexity). 

If the problem in hand includes Dirichlet boundary conditions these can be included here 
as linear constraints that the D^s corresponding to points on the "boundary" of 0& must 
satisfy. 

, 

5. Let (v , D ) be the solution to Vk- We define Vk(9) '■= maxjpj(^), where 

p i {6) = v k i +D k i -{6-e i ). 

6. Vk yields an approximation to the minimizer of V . 

Remark 3.2 The constraints of the non-linear program determine a convex set. 

Remark 3.3 4 ( c ) guarantees that pi is a supporting hyperplane of the convex hull of the points 
{(9i,vi), . . . , (Ok n , v k n )}- Note that Vk is a piecewise affine convex function. 

3.1 Convergence of the Algorithm 

Proposition 3.4 Under the assumptions made on L, the problem Vk has a unique solution. 
PROOF. The function 

Jk(v k , D k ) := ^ (9i ■ D\ - v\ - C(D\)) 

i=l 
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is strictly convex. It follows from proposition 12.21 that any acceptable vector-matrix pair 
must lie in [0, K] k x Q k , which together with Remark 13.21 implies V k consists of minimizing a 
strictly convex function over a compact and convex set. The result then follows from general 
theory. □ 

Proposition 3.5 There exists v G C such that: 

1. The sequence {v k } generated by the Vk's has a subsequence {v kj }that converges uniformly 
to v. 

2. lim^oo I[v kj ] = I[v}. 

Proof. The bounded (Proposition 12. 2p family {Uj} is uniformly equicontinuous, as it consists of 
convex functions with uniformly bounded subgradients. By the Arzela-Ascoli theorem we have 
that, passing to a subsequence if necessary, there is a non-negative and convex function v such 
that 

Vk — > v uniformly on 0. 

By convexity Vv k —* almost everywhere (lemma lA.5j) : since X7Uk(0) belongs to the bounded 
set Q, the integrands are dominated. Therefore, by Lebesgue Dominated Convergence we have 

lim I[v k ] = I[v]. 

k-^oo 

□ 

Let u be the maximizer of /[•] within C. Our aim is to show that {vk} is a minimizing sequence 
of problem V, in other words that 

lim I [v k ] =I[u]. 

k— >oo 

We need the following 

Definition 3.6 Letu be such that inl u ^c I[ u ] = Ifa]- Given the lattice Qk, w & define: 

1. u k :=u(8i), 

2. G k := Vu{6i), 

3. qi {9) :=u k + G k ■ (9-0i) and 
4- u k {6) := supjft^). 
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Notice that u k (6) is also constructed as the convex envelope of a family of affine functions. 
The inequalities 



J k {u\G k )> J k {v\D k ) 



(3) 



I[v k ] > I[u] 
, k _ 

follow from the definitions of J k (v ,D ), v k and u k , as does the following 
Proposition 3.7 Let u and u k be as above, then u k -^u uniformly as k — > oo. 
Proposition 3.8 For each k there exist €\(k) and e2(k) such that 



J k (v k ,D k )-I[v k ] <e 1 (k) 



(4) 



(5) 



J k (u k ,G k )-I[u k ] <e 2 (k) (6) 

and ei(fc), €2 (k) —> as k — > cxo. 

Proof. We will show ([5]) holds, the proof for ([6]) is analogous. Define the simple function 



w k (0) := L(9j,v k ,Dj), 6ea k , 



hence 



J k (v k ,D k ) = [ w k (8)d8. 
The left-hand side of ([5]) can be written as 



e 



w k {9)d0 - I[v k ] 



(7) 



(8) 



It follows from Lemma lA.71 that there exists ei(fc), such that 



e 



w k (9)d6 - I[v k ] 



< ei(k) 



and 



ei(k) ^0 k — > 00. 



□ 



We can now prove the main theorem in this section, namely 
Theorem 3.9 TTte sequence {v k } is minimizing for problem V . 



PROOF. It follows from Proposition 13.81 and equation (J3|) that 



I[u k ] + e 2 (k) + ei(fc) > I[v k ] > I[u] 



(9) 



Letting k — > oo in and using Proposition 13.71 yields the desired result. 



□ 



4 Examples 

In this section we show some results of implementing our algorithm. The first two examples reduce 
quadratic programs, whereas the third and fourth ones are non-linear optimization programs. 
All the computer coding has been written in MatLab. However, in both cases supplemental 
Optimization Toolboxes were used. In the first two examples we used the Mosek 4.0 Optimization 
Toolbox, wherease in the last two we used Tomlab 6.0. These four examples share a common 
microeconomic motivation, for which we provide an overview. We refer the interested reader to 
[2] for a comprehensive presentation of Principal- Agent models and Adverse Selection, as well as 
multiple references. 

4.1 Some Microeconomic Motivation 

Consider an economy with a single principal and a continuum of agents. The latter 's prefer- 
ences are characterized by n-dimensional vectors. These are called the agents' types. The set 
of all types will be denoted by O C M n . The individual types 9 are private information, but the 
principal knows their statistical distribution, which has a (non-atomic) density f(0). 

Our model takes a hedonic approach to product differentiation. We assume goods are char- 
acterized by (n-dimensional) vectors describing their utility-bearing attributes. The set of tech- 
nologically feasible goods that the principal can deliver will be denoted by Q C M" , and it 
will be assumed to be compact and convex. The cost to the principal of producing one unit of 
product p is denoted by C(p). Products are offered on a take-it-or-leave-it basis, each agent can 
buy one or zero units of a single product p and it is assumed there is no second-hand market. 
The (type-dependent) preferences of the agents are represented by the function 



f/:6xQ->l. 



The (non-linear) price schedule for the technologically feasible goods is represented by 



vr : Q -> R. 



When purchasing good q at a price ir(q) an agent of type 9 has net utility 



U(6,q) 
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Each agent solves the problem 



max {U (9, q) — ir(q)} . 
q&Q 

By analyzing the choice of each agent type under a given price schedule ir, the principal screens 
the market. Let 

v(9):=U(9,q(9))-ir(q(9)), (10) 
where q{6) belongs to argmax q ^Q {U(9,q) — ir(q)} ■ Notice that for all q in Q we have 

v(9)>U(9,q)-ir(q) (11) 

Analogous to the concepts of subdifferential and convex conjugate from classical Convex 
Analysis, we have that the subset of Q where (|lip is an equality is called the [/-subdifferential 
of v at 9 and v is the [/-CONJUGATE of 7r (see, for example, [1]). We write 

v{9) = tt u (9) 

and 

duv{9) := {q E Q \ ir u (9) + ir(q) = U(9, q)} 

To simplify notation let n(q(9)) = ~k{9). A single pair (q(9),7r(9)) is called a contract, whereas 
{(g(6>), Tr(9))}0 e Q is called a catalogue. A catalogue is called incentive compatible if v(9) > 
vo(9) for all 9 6 B, where t>o(#) is type's 9 non-participation (or reservation) utility. We normalize 
the reservation utility of all agents to zero, and assume there is always an outside option qo 
that denotes non-participation. Therefore we will only consider functions v > 0. The Principal's 
aim is to devise a pricing function ir : Q — > K as to maximize her income 

f^{9)-C(q(9)))f(9)d9 (12) 
Inserting (|10p into (|12p we get the alternate representation 

J (U(9, q{9)) - v(9) - C{q(9))) f(9)d9. (13) 

Expression (|13j) is to be maximized over all pairs (v,q) such that v is U-convex and non-negative 
and q{9) € djjv(9). Characterizing duv{9) in a way that makes the problem tractable can be 
quite challenging. In the case where U(9,q(9)) = 9 ■ q(9), as in [T7], for a given price schedule 
7r : Q — > R, the maximal net utility of an agent of type 9 is 

v(9) :=max{0-g-7r(g)} (14) 
qeQ 
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Since v is denned as the supremium of its affine minorants, it is a convex function of the types. It 
follows from the Envelope Theorem that the maximum in equation (|14p is attained if q{0) = Vu(0), 
and we may write 



v {0) = ■ Vv(6) - tt(Vu(0)). (15) 
The principal's aggregate surplus is given by 

Q A q (e))-c{ q (9)))f(e)d9. (ie) 

Inserting (|15p into (|16p we get that the principal's objective is to maximize 



I[v]:= (0-Vv(e)-C(Vv(6))-v(9))f(e)d9 (17) 
J (3 

over the set 

C := {v : 6 -> M | u convex, u > 0, Vv(0) € Q} . 

4.2 The Musa-Rosen Problem in a Square 

The following structures are shared in the first two examples: 

• x = (v k ,D k ), this structure will determine any possible candidate for a minimizer to Jfc(-, •) 
in the following way: i> fc is a vector of length k 2 that will contain the approximate values 
of the optimal function v evaluated on the points of the lattice. The vector D k has length 
2 * k 2 and it contains what will be the partial derivatives of v at the same points 

• h is a vector of length 3 * k 2 . The product hx provides the discretization of the integral 
J @ (e-Vv-v(9))f(6)d6. 

• B is the matrix of constraints. The inequality Bx < imposes the non-negativity of v and 
D and the convexity of the resulting v/.. 

Remark 4.1 The density f(6) is "built into" vector h and the cost function C. 

Let 9 = [1,2] 2 , C(q) = \\\q\\ 2 , and assume the types are uniformly distributed. This is our 
the benchmark problem, since the solution to the principal's problem can be found explicitly |17j . 
In this case we have to solve the quadratic program 

sup hx — -x t Hx 

x £ 
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subject to 



Bx < 



H is a (3 * k 2 ) x (3 * k 2 ) matrix whose first k 2 columns are zero, since v does not enter the cost 
function; the four k 2 x k 2 blocks towards its lower right corner form a (2 * k 2 ) x (2 * k 2 ) identity 
matrix. Therefore \x l Hx is a discretization of Jq \\Vv(6)\\d9. Figure 1(a) was produced using a 
17 x 17-points lattice and a uniform density, whereas Figure 2(b)| shows the traded qualities. 



1.5, 



M9M 



0.5 1 



(a) The optimal function v. (b) The qualities traded. 

Figure 1: Optimal solution for Uniformly distributed agent types. 



4.3 The Musa-Rosen Problem with a Non-uniform Density 



We keep the cost function of the previous example, but now assume the types are distributed 
according to a bivariate normal distribution with mean (1.9, 1) and variance-covariance matrix 

.3 .2 
.2 .3 

As noted before, the weight assigned to each to each agent type is built into h and H, so the 



vector x remains unchanged. We obtain figure 2(a) 



Remark 4.2 It is interesting to see that in this case bunching of the second kind, as described 



by Roche and Chone in {Til, appears to be eliminated as a consequence of the skewed distribution 
of the agents. This can be seen in the non-linear level curves of the optimizing function v. This 
is also quite evident in the plot of the qualities traded, which is shown below. 

The MatLab programs for the two previous examples were run on MatLab 7.0.1.24704 (R14) 
in a Sun Fire V480 (4x1.2 HGz Ultra III, 16 GB RAM) computer running Solaris 2.10 OS. In the 
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(a) The optimal function v. (b) The qualities traded. 

Figure 2: Optimal solution for normally distributed agent types. 

first example 57.7085 seconds of processing time were required. The running time in the second 
example was 81.7280 seconds. 

4.4 An example with Non-quadratic Cost 

In this example we approximate a solution to the problem of a principal who is selling over-the- 
counter financial derivatives to a set of heterogeneous agents. This model is presented by Carlier, 
Ekeland and Touzi in [5]. They start with a standard probability space (CI, J 7 , P), and the types 
of the agents are given by their risk aversion coefficients under the assumption of mean- variance 
utilities; namely, the set of agent types is G = [0, 1], and the utility of an agent of type 9 when 
facing product X is 

U(9, X) = E[X] -6Var[X] 

Under the assumptions of a zero risk-free rate and that the principal has access to a complete 
market, her cost of delivering product X(6) is given by —\J —£v'(0); where £ is the variance of 
the Radon-Nikodym derivative of the (unique) martingale measure, and Var[X(0)] = —v'(9). The 
principal's problem can be written as 

sup f (0v'(d) + \f-v'(6) - v(6)) d6 (18) 
v&C JQ v ' 

where C := {v : — > K. | v convex, v > 0, v' < and Var [X(#)] = —v'(9)} . Figure S3] shows an 
approximation of the maximizing v using 25 agent types. 
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0.2 0.4 0.6 0.8 



Figure 3: The optimal function v. 



4.5 Minimizing Risk 

The microeconomic motivation for this section is the model of Horst & Moreno [11] . We present 
an overview for completeness. The principal's income, which is exposed to non-hedgeable risk 
factors, is represented by W < 0. The latter is a bounded random variable defined on a standard, 
non-atomic, probability space (fi, J-, P) . The principal's goal is to lay off parts of her risk with 
the agents whose preferences are mean- variance. The agent types are indexed by their coefficients 
of risk aversion, which are assumed to lie G = [a, 1] for some a > 0. The principal underwrites 
call options on her income with type-dependent strikes: 



where p(X) = logE[exp{—(3X}\ for some (5 > 0. The principal's problem is to devise a catalogue 
as to minimize her risk exposure. Namely, she chooses a function v and contracts X from the set 




via the "entropic measure" of her position, i.e. 




{(X,v) | v e C, v< Kx, -Var[(|W| - K(9)) + ] =v'(6), \v'\ < K 2 , < K(6) < WW^} 
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in order to minimize 
where 
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{{\W\- F{v'{6))) + - E[(|W| - F(v>(8))) + }} d ) - I(v). 
(6v'(9) - v(9)) dO. 



Kv) 







We assume the set of states of the World is finite with cardinality m. Each possible state loj 
can occur with probability pj. The realizations of the principal's wealth are denoted by W = 
(W±, . . . , W m ). Note that p and W are treated as known data. The objective function of our 
non-linear program is 



F(v,v',K) = log^p^-^WiPi + ^^^TiKj-lW^Pi 

+ - Vi - 6^ 

where K = {K\, . . . , K n ) denotes the vector of type dependent strikes. We denote by ng the total 
number of constraints. The principal's problem is to find 

mm F(v, v', K) subject to G(v, v' , K) < 

(v,v',K) 

where G : M 3n — > M. n9 determines the constraints that keep (v,v',K) within the set of feasible 
contracts. Let (1/6, 2/6, . . . ,1) be the uniformly distributed agent types, and 

• W = 4 * (-2, -1.7, 1.4, -.7, -.5, 0), 

• P = (1/10,1.5/10,2.5/10,2.5/10,1.5/10,1/10). 

The principal's initial evaluation of her risk is 1.52. The following are the plots for the approxi- 
mating v and the strikes: 

Note that for illustration purposes we have changed the scale for the agent types in the second 
plot. The interpolates of the approximate to the optimal function v and the strikes are: 



i=i \j=i 
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K 5 


0.713309 
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(a) The optimal function v. (b) The type-dependent strikes 

Figure 4: Optimal solution for underwriting call options. 



The Principal's valuation of her risk after the exchanges with the agents decreases from 11.49 to 
-3.56. 

Remark 4.3 Notice the "bunching" at the bottom. 

5 Conclusions 

In this paper we have developed a numerical algorithm to estimate the minimizers of variational 
problems with convexity constraints, with our main motivation stemming from Economics and 
Finance. Ours is an internal method, so at each precision level the approximate minimizers 
lie within the acceptable set of functions. Our examples are developed over one or two dimen- 
sional sets for illustration reasons, but the algorithm can be implemented in higher dimensions. 
However, it must be mentioned that, as is the case with the other methods found in the related 
literature, implementing convexity has a high computational cost which increases geometrically 
with dimension. 

6 Appendix 

A Some technical results 

In order to prove convergence of our algorithm we make use of the Convex Analysis results 
contained in this section. We will work on C, an open and convex subset of l ra . 

Definition A.l A mapping F : C — » P(W n ) (the power set ofW n ) is said to be set valued if 
for each x in C, F(x) is a non-empty subset o/R m . 
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Recall that if / : C — > R is a convex function, then the subdifferential of / at x, defined as 

df{x) := {b G R" | f(y) - f(x) > b ■ (y - x) for ally G C}, 
is a non-empty subset of R n for all x in C. Therefore, the set valued mapping 

x — ► df(x) 

is well defined on C. Notice that if for some x in C we have ffdf{x) = 1, then 

= {V/(x)}. 

In such case we say the subdifferential mapping is single valued at x and we can simply 
identify it with the gradient of / at x. 

Definition A. 2 Let F : C — > R m be a set valued function. Then we say F is differentiable 
at xo iff there is a linear mapping L xo : C — > M. m such that for all e > there is 5 > such that 
if Vq ^ /( x o)) 2/ £ anc ^ ll^ ~ x o\\ < S then 

\\y - 2/o - L X0 (x - x )|| < 

— Xo|| 

The following theorem is due to Alexandrov ( |12j ) 

Theorem A. 3 iei / : W 1 — > R 6e convex, then se set valued function df is differentiable almost 
everywhere. 

Clearly, in the case where / is single valued, definition IA.2I is equivalent to the regular definition 
of a Frechet differentiable function. Moreover if / differentiable at xq and we choose yo, yi in 
f(xo) and let x = xq in definition IA.2I we get 

1 1 2/0 - 2/1 II < 0, 

which implies / is single valued at Xq. It follows from Alexandrov's Theorem and the observation 
above that for almost all 9 E 0, the set valued mapping 9 — > df(9) can be identified with the 
single valued assignment 9 — > V/(0), and we have the following 

Corollary A. 4 Let f : C — » R 6e convex . Then the mapping 

-> V/(0) 

is weZ/ defined and continuous almost everywhere. 

Proposition A. 5 Lei A C R n 6e a convex, open set. Assume the sequence of convex functions 
{fk '■ ^4toR} converges uniformly to f, then V/fc — > V/ almost everywhere on A. 
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Proof. Denote by Dif the derivative of / in the direction of e^. The convexity of fk and / 
implies the existence of a set B, with fi(A\B) = such that the partial derivatives of fk and / 
exist and are continuous in B. Let x G B. To prove that Difk(x) — > D{f(x), consider rj such that 
x + r/ej G A. Since is convex 

+ het) - f k (x) , . / fc (> - /iej) - 

* " A/fc(x) " h 

for all < h < r/. Hence 

l^±M^IM _ A/>) > A/fc(x) _ A/ » > /*(*-*"*)-/*(*) _ 

The left-hand side of this inequality is equal to 

f k (x + hei)-f{x + hei) f(x)-f k (x) f(x + hei)-f(x) - 
h + h + h DJ(X) - 

For e > let < 5 < r/ be such that 

f(x + hei) - f(x 



h 

for \h\ < 5. Let N £ N be such that 



Dif(x] 



< e 



-e5 < f n (x) - f{x) < ed 
n > N. Hence, taking h = 5, we have that for all n > N, 

fn{x + hei) -J(x + hei) < J(x) - f n (x) < 

h ~ h ~ 

Hence 

3e > Z>i/ n (s) - Dj(x) 
for all x G B. The same argument shows that 

-3e < Di/ n (x) - 

for all n > N and all x € -B, which concludes the proof. □ 

Proposition A. 6 Let U C M. n be a convex, compact set and let g : U — > R 6e a convex function 
such that for all x & U, the subdijferentials dg(x) are contained in Q for some compact set Q. 
Then there exists {gj : U — ► R} such that gj G C 1 ([/) and Oj — > <7 uniformly on U. 

PROOF. Fix 5 > and define 

^ := {(l + S)x \ xeU}. 
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Extend g to be denned on 17$. Let K e be a family of mollifiers (see, for instance p3]), then the 
functions 

h e := g* K e 

are convex, smooth and they converge uniformly to g on U as long as e is small enough so that 

U t := {xeU 5 | d(x,dU s ) > e} 

is contained in U. Let n G N be such that f7]/ n C E7, then the sequence {gj := hi/j} has the 
required properties. □ 

Lemma A. 7 Consider (f)(9, z,p) £ C 1 (6 x K x Q -> R) , where = [a, b] n and Q is a compact 
convex subset ofM n . Let {f k : G — > M} be a family of convex functions such that dfxiO) C Q f or 
all 9 G 0, and whose uniform limit is f. Let H k 6e £/ie uniform partition of consisting of k n 
cubes of volume ||Sfe|| := (n$r) • Denote by a^, 1 < j < k n , be the elements ofE k and Ze£ 



|S fe ||^^,/ fc (^),V/ fe (^)) 



1=1 



be the corresponding Riemann sum approximating Jq <fi(x, fk(0), V f k (9))d9, where 8j G o~j and 



(Tj G Efc. Taen /or any e > i/iere is K G N smc/i i/iai 



0(0, f k (9),Vf k (9))d9 - \\E k \\ 0(*J, /fc(flj), V/ fc (eJ)) 



i=i 



< e (19) 



/or any k > K. 



functions such that 



9 k j ->• /fc uniformly. 



Proof. By lemma lA. 6} for each /& there exists a sequence of continuously differentiable convex 

Let h k be the first element in {g^} such that \\h k — fk\\ — \ and \\Vh k {9) — Vfk(0)\\ < \ for 
all 9 G G where V/& is continuous. Then /ij. — > / uniformly, and by Lemma TA.5I we have that 
V/ifc(0) — > V/(0) a.e. It follows from Egoroff's theorem that for every n G N there exists a set 
A„C9 such that: 

fx(Q \ A n ) < 1/n and Vh k — ► V/ uniformly on A n . 
Let fc(-) be the indicator function of eft and define 

3 J 

g k {9) := <p(9,h k (9)yk k (9)) -^2x a ,(9)^(9 l ;,h k (9 , ;),\/h k (e k j )) 

3=1 
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Fix n, consider 9q € A n and let {9q} be the sequence of Oj's converging to 9q as the partition is 
refined. By uniform convergence, V/ is continuous on A n , hence 



lim h k (6fr ■ 

k— >oo 



f(9 ) and lim Vh k {9 k Q ) 



k— >oo 



V/(0 O ) 



(20) 



It follows from (|20|) and the continuity of <fi that —> almost everywhere on A n . Notice that as 
a consequence of the compactness of G and Q and the definition of h k we have 



\\(f>{9J k (9),Vf k {9))\\ <Kx, foral 9£& and some K x > 0. 



and 



- Up, /fcW, v/ fc (e)) - J2 x °) wwl 

for some > and all ^ € where V/fc is continuous. Therefore 

k n 

J k (9),Vf k (9))d9-\\a$\\^<f>(9*J k (9>*),Vf k (9ty 



< 



K 2 



e 



i=i 



A: 



e 



9kW9 



By Lebesgue Dominated Convergence 



lim / g k (9)d9 = 0, 

J An 



moreover, the definition of A n implies 



[ 9k(e)d9 < 

JQ\A„ 



2K\ 



n 



Given e > take n € N such that < | and K such that 



#2|G] 
AT 



+ 



g K (0)d9 



< 



(21) 



Then equation (|19f) holds for all fe > A'. 



□ 



B MatLab code for the examples in section [4] 
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